Web mining for Web image retrieval
نویسندگان
چکیده
The popularity of digital images is rapidly increasing due to improving digital imaging technologies and convenient availability facilitated by the Internet. However, how to find user-intended images from the Internet is non-trivial. The main reason is that the web images are usually not annotated using semantic descriptors. In this paper, we present an effective approach to and a prototype system for image retrieval from the Internet using web mining. The system can also serve as a web image search engine. One of the key ideas in the approach is to extract the text information on the web pages to semantically describe the images. The text description is then combined with other low-level image features in the image similarity assessment. Another main contribution of this work is that we apply data mining on the log of user’s feedback to improve image retrieval performance in three aspects: First, the accuracy of the document space model of image representation obtained from the web pages is improved by removing clutter and irrelevant text information; Second, to construct the user space model of users’ representation of images, which is then combined with the document space model to eliminate mismatch between the page author’s expression and the user’s understanding and expectation; Third, to discover the relationship between low-level and high-level features, which is extremely useful for assigning the low-level features’ weights in similarity assessment.
منابع مشابه
بازیابی اطلاعات تصویری حوزهی سلامت در وب از دیدگاه متخصصان علوم پزشکی:یک مطالعه کیفی
Introduction: The medical image as a source of non-textual information has an important role in the field of medicine. Since the quality of life is directly related to health, employing this type of information is effective in improving the practice of health professionals. This study was aimed to survey medical image retrieval in the Web from the perspective of experts in medical sciences. M...
متن کاملSemantic-Based Web Mining For Image Retrieval Using Enhanced Support Vector Machine
This paper deals with the semantic-based web mining for image retrieval by means of enhanced Support Vector Machine (SVM). Generally, conventional Content-Based Image Retrieval (CBIR) systems are unsuccessful to satisfy users’ requirement because of the ‘semantic gap’ among the derived features and the user’s query. A large amount of existing approaches shows certain predetermined semantic cate...
متن کاملImage Clustering Technique for Web Search Engine Retrieval System
In Web Search Engine, Clustering is an efficient way of reaching information from raw data and K-means is a basic method for it. Although it is easy to implement and understand, but it has serious drawbacks. So we go for some other techniques for filtering process like greedy global algorithm. These types of algorithms are also work as a text mining techniques over the web and also cluster the ...
متن کاملA Survey on Web Research for Data Mining
Web mining is the application of data mining techniques to extract knowledge from web data, including web documents, hyperlinks between documents, usage logs of web sites, etc. The process of extracting useful information from the contents of web document is data mining. Content data is the collection of facts a web page is designed to contain. It may consist of text, images, audio, video, or s...
متن کاملWeb mining: a survey in the fuzzy framework
This article provides a survey of the available literature on fuzzy Web mining. The di-erent aspects of Web mining, like clustering, association rule mining, navigation, personalization, Semantic Web, information retrieval, text and image mining are considered under the existing taxonomy. The role of fuzzy sets in handling the di-erent types of uncertainties/impreciseness is highlighted. A hybr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JASIST
دوره 52 شماره
صفحات -
تاریخ انتشار 2001